Study of the average size of Glushkov and Partial Derivative Automata
نویسندگان
چکیده
In this paper, the relation between the Glushkov automaton (Apos) and the partial derivative automaton (Apd) of a given regular expression, in terms of transition complexity, is studied. The average transition complexity of Apos was proved by Nicaud to be linear in the size of the corresponding expression. This result was obtained using an upper bound of the number of transitions of Apos. Here we present a new quadratic construction of Apos that leads to a more elegant and straightforward implementation, and that allows the exact counting of the number of transitions. Based on that, a better estimation of the average size is presented. Asymptotically, and as the alphabet size grows, the number of transitions per state is on average 2. Broda et al. computed an upper bound for the ratio of the number of states of Apd to the number of states of Apos, which is about 12 for large alphabet sizes. Here we show how to obtain an upper bound for the number of transitions in Apd, which we then use to get an average case approximation. Some experimental results are presented that illustrate the quality of our estimate.
منابع مشابه
The Average Transition Complexity of Glushkov and Partial Derivative Automata
In this paper, the relation between the Glushkov automaton (Apos) and the partial derivative automaton (Apd) of a given regular expression, in terms of transition complexity, is studied. The average transition complexity of Apos was proved by Nicaud to be linear in the size of the corresponding expression. This result was obtained using an upper bound of the number of transitions of Apos. Here ...
متن کاملOn the Average State Complexity of Partial derivative Automata: an analytic Combinatorics Approach
The partial derivative automaton (Apd) is usually smaller than other nondeterministic finite automata constructed from a regular expression, and it can be seen as a quotient of the Glushkov automaton (Apos). By estimating the number of regular expressions that have ε as a partial derivative, we compute a lower bound of the average number of mergings of states in Apos and describe its asymptotic...
متن کاملOn the Average Number of States of Partial Derivative Automata
The partial derivative automaton (Apd) is usually smaller than other non-deterministic finite automata constructed from a regular expression, and it can be seen as a quotient of the Glushkov automaton (Apos). By estimating the number of regular expressions that have ε as a partial derivative, we compute a lower bound of the average number of mergings of states in Apos and describe its asymptoti...
متن کاملPrefix and Right-Partial Derivative Automata
Recently, Yamamoto presented a new method for the conversion from regular expressions (REs) to non-deterministic finite automata (NFA) based on the Thompson ε-NFA (AT). The AT automaton has two quotients discussed: the suffix automaton Asuf and the prefix automaton, Apre. Eliminating ε-transitions in AT, the Glushkov automaton (Apos) is obtained. Thus, it is easy to see that Asuf and the partia...
متن کاملOn the Average Size of Glushkov and Partial derivative Automata
In this paper, the relation between the Glushkov automaton (Apos) and the partial derivative automaton (Apd) of a given regular expression, in terms of transition complexity, is studied. The average transition complexity of Apos was proved by Nicaud to be linear in the size of the corresponding expression. This result was obtained using an upper bound of the number of transitions of Apos. Here ...
متن کامل